DGX Software Stack

您所在的位置:网站首页 nvidia modeset DGX Software Stack

DGX Software Stack

#DGX Software Stack| 来源: 网络整理| 查看: 265

The following is a list of packages installed as part of the DGX Software Stack, broken out by metapackage name and platform.

DGX A100

dgx-a100-system-configurations

dgx-release

nv-cpu-governor

nv-hugepage

nv-iommu-pt

nv-ipmi-devintf

nv-limits

nv-update-disable

nvgpu-services-list

nvidia-acs-disable

nvidia-crashdump

nvidia-esm-hook-epilogue

nvidia-fs-loader

nvidia-kernel-defaults

nvidia-nvme-smartd

nvidia-pci-bridge-power

nvidia-redfish-config

nvidia-relaxed-ordering-gpu

nvidia-relaxed-ordering-nvme

dgx-a100-system-tools

dgx-release

ipmitool

nv-common-apis

nv-env-paths

nvidia-mig-manager

nvidia-raid-config

nvme-cli

tpm2-tools

dgx-a100-system-tools-extra

msecli

DGX-2

dgx2-system-configurations

dgx-release

nv-cpu-governor

nv-hugepage

nv-iommu-pt

nv-ipmi-devintf

nv-limits

nv-update-disable

nvgpu-services-list

nvidia-acs-disable

nvidia-crashdump

nvidia-esm-hook-epilogue

nvidia-fs-loader

nvidia-kernel-defaults

nvidia-nvme-smartd

nvidia-pci-bridge-power

nvidia-redfish-config

nvidia-relaxed-ordering-gpu

nvidia-relaxed-ordering-nvme

dgx2-system-tools

dgx-release

ipmitool

nv-common-apis

nv-env-paths

nvidia-raid-config

nvme-cli

tpm-tools

dgx2-system-tools-extra

msecli

DGX-1

dgx1-system-configurations

dgx-release

nv-ast-modeset

nv-cpu-governor

nv-hugepage

nv-ipmi-devintf

nv-limits

nv-update-disable

nvgpu-services-list

nvidia-crashdump

nvidia-esm-hook-epilogue

nvidia-fs-loader

nvidia-kernel-defaults

nvidia-pci-bridge-power

dgx1-system-tools

dgx-release

ipmitool

nv-common-apis

nv-env-paths

dgx1-system-tools-extra

nvidia-raid-config

storcli

DGX Station

dgxstation-system-configurations

dgx-release

nv-hugepage

nv-limits

nv-update-disable

nvgpu-services-list

nvidia-crashdump

nvidia-esm-hook-epilogue

nvidia-fs-loader

nvidia-kernel-defaults

dgxstation-system-tools

dgx-release

nv-common-apis

nv-env-paths

nvidia-raid-config

DGX Station A100

dgxstation-a100-system-configurations

dgx-release

nv-cpu-governor

nv-hugepage

nv-iommu-pt

nv-ipmi-devintf

nv-limits

nv-update-disable

nvgpu-services-list

nvidia-crashdump

nvidia-esm-hook-epilogue

nvidia-fs-loader

nvidia-kernel-defaults

nvidia-nvme-smartd

nvidia-pci-bridge-power

nvidia-redfish-config

nvidia-relaxed-ordering-gpu

nvidia-relaxed-ordering-nvme

dgxstation-a100-system-tools

dgx-release

ipmitool

nv-common-apis

nv-env-paths

nvidia-mig-manager

nvidia-raid-config

nvme-cli

tpm2-tools

dgxstation-a100-system-tools-extra

msecli

The folowing packages are installed by the nvidia-mlnx-ofed-misc metapackage:

mlnx-fw-updater

mlnx-pxe-setup

nvidia-mlnx-config

nvidia-peermem-loader

The following additional packages are part of the DGX Software Stack:

nv-docker-options

nvidia-logrotate

nvidia-motd

nvidia-ipmisol

The following table lists all packages that will be installed as part of the system configuration package with more details:

Package

Description

1

2

A

dgx-release

Release information

R

R

R

nv-ast-modeset

Disable the Aspeed display driver. It can cause issues with connected monitors. The AST2xxx is the BMC used in our servers.

[DGX-1, DGX-2, DGX A100, DGX Station A100]

R

R

R

nv-enable-nvme-hot-plug

Configure kernel parameters for NVMe hot plug (see also kernel section below).

R

nv-hugepage

Sets the “transpa rent_hugepa ge=madvise” kernel parameter.

R

R

R

nv-iommu-pt

Sets iommu=pt for AMD Rome platforms.

R

nv-ipmi-devintf

Add the i pmi_devintf module for accessing the BMC using the ipmi tool.

R

R

R

nv-limits

Increase the process resource limits for users (ulimits nofile 50000)

R

R

R

nv-update-disable

Disable automatic system upgrades. Users need to explicitly upgrade their systems using apt.

R

R

R

nvgpu-services-list

Lists GP U-consuming services in .json format, such as DCGM or NVSM, and required by the firmware update mechanism.

R

R

R

nvidia-acs-disable

Disables the PCIe ACS capability to allow for better GPU- direct performance in bare-metal use cases on DGX A100.

R

nvidia-crashdump

Tools to manage kernel crash dumps. They are disabled by default.

R

R

R

nv-docker-options

Increases SHMEM and other resources.

R

R

R

nvidia-ipmisol

[optional]

Enables serial output through the BMC

(SOL - Serial over Lan)

O

O

O

nvidia-kernel-defaults

Disable ARP for security i mprovements ne t.ipv4.conf

.all.a rp_announce = 2

.all .arp_ignore = 1

.default.a rp_announce = 2

.default .arp_ignore = 1

R

R

R

nvidia-logrotate

Modify the logrotate co nfiguration

O

O

O

nvidia-motd

Modify message -of-the-day (MOTD) to display NVSM health monitoring alerts and release i nformation.

O

O

O

nvidia-nvme-smartd

Enables SMART monitoring on NVME devices. By default, smartd will skip NVME devices.

R

R

nvidia-pci-bridge-power

Sets the bridge power control setting to “on” for all PCI bridges.

R

R

R

nvidia-relaxed-ordering-gpu

Sets a reg-key to enable PCIe relax ed-ordering in the GPUs

R

nvidia-relaxed-ordering-nvme

Installs a script that users can call to enable re laxed-order in NVME devices.

R

nvidia-redfish-config

Configures the redfish interface with an interface name and IP address. The interface name is “bmc _redfish0”, while the IP address is read from DMI type 42.

R

Legend:

1

DGX-1

2

DGX-2

A

DGX A100

R

Required package

O

Optional package



【本文地址】


今日新闻


推荐新闻


CopyRight 2018-2019 办公设备维修网 版权所有 豫ICP备15022753号-3